Information decomposition of symbolical sequences
نویسندگان
چکیده
Earlier comprehensive mathematical methods were developed for the study of periodicity of continuous and discrete numerical sequences, using Fourier transformation and allowing the definition of the spectral density of a numerical sequence. However, such an application of a Fourier transform demands presentation of a symbolic sequence as a numerical sequence in which the properties of any symbolic text should be displayed unequivocally. The most widely used is the method, including construction from the given symbolic sequence of m symbols consisting of numbers zero and one, and formed according to the law: x(i,j)=1, if the symbol ai occupies a site j, and x(i,j)=0 in all other cases. Here A={a1, a2, ..., am} is the alphabet of a symbolical sequence and m is the size of the alphabet of a symbolical sequence. Then the Fourier transformation is applied to each of the numerical sequences and the Fourier-harmonics are calculated, corresponding to i-type symbols, as well as to matrix structural factors, corresponding to pair correlation of symbols.
منابع مشابه
Information decomposition of symbolic sequences
We developed a non-parametric method of Information Decomposition (ID) of a content of any symbolical sequence. The method is based on the calculation of Shannon mutual information between analyzed and artificial symbolical sequences, and allows the revealing of latent periodicity in any symbolical sequence. We show the stability of the ID method in the case of a large number of random letter c...
متن کاملSymbolical Kernel-Based Reasoning: Its Application to the Rule Extraction in Dictyostelium discoideum DNA
Kernel-based method is one of the promising algorithms in the pattern recognition field [1]. We have extended the algorithm to the symbolical reasoning tasks (we call the algorithm symbolical kernel-based reasoning (SKBR)) [2], and we currently apply the SKBR to the rule extraction in Dictyostelium discoideum the DNA and cDNA databases of which are well established already [3]. Goal of our proj...
متن کاملSome decomposition formulas of generalized hypergeometric functions and formulas of an analytic continuation of the Clausen function
In this paper, using similar symbolical method of Burchnall and Chaundy formulas of expansion for the generalized hypergeometric function were constructed. By means of the found formulas of expansion the formulas of an analytic continuation for hypergeometric function of Clausen is defined. The obtained formulas of an analytic continuation express known hypergeometric Appell function F2 (a; b1,...
متن کاملA Better and Efficient DNA DATA COMPRESSOR by Fusion of Symbolical and ARLE Technique
Data compression is concerned with how information is organized in data. The size and importance of these databases will be bigger and bigger in the future; therefore this information must be stored or communicated efficiently though there are many text compression algorithms, they are not well suited for the characteristics of DNA sequences. There are algorithms for DNA compression which takes...
متن کاملCalculation of diffusion effect for arbitrary pulse sequences.
A method is presented for calculating the nuclear spin magnetization created by an arbitrary number of short radio frequency pulses and of piecewise constant gradient applied in a selected direction. The isotropic diffusion, the transverse and longitudinal relaxations as well as the global transport are taken into account. A thorough analysis of the magnetization density evolution results in an...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003